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^nopa^y- assoc iated- virus (LAV) a^ p; , MS&^3 ; f^ tt UHk 
said LAV aanQmic RNA^ _^jg^^j§j^BB& 

~flv fiTU9 invention relates to cloned DNA sequences 
indistinguishable from genomic RNA and DNA of lympha- 
denopathy-associated virus (LAV), a process for their 
preparation and their uses. It relates more particularly 
to stable probes including a DNA sequence which can be 
used for the detection of the LAV virus or related viruses 
or DNA proviruses in any medium, particularly biological 
samples containing any of them. The invention also relates 
to polypeptides, whether glycosylated or not, encoded by 
said DNA sequences. 

Lymphadenopa thy-associa ted virus (LAV) is a human 
retrovirus first isolated from the lymph node of a homo- 
sexual patient with lymphadenopa thy syndrome, frequently a 
prodrome or a benign form of acquired immune deficiency 

-nave 


syndrome (AIDS). Subsequently, other LAV isolates «ha ve bw rn 
recovered from patients with AIDS or pre-AIDS. All availa- 
ble data are consistent with the virus being the causative 
agent of AIDS. 

A method for cloning such DNA sequences has alrea- 
dy been disclosed in British Patent Application Nr. 
84 23659, filed on September 19, 1984. Reference is here- 
after made to that application as concerns subject matter 
in common with the further improvements to the invention 
disclosed herein. 

The- present invention aims at providing additional 

/2A^2 

means which ^ shoul d not only -als - o be J useful for the 

•*4 


new 

detection of LAV or related viruses, (hereafter more 
generally referred to as "LAV viruses/), but alsq^ 


have 


4^ 


more versa tility . particularly in detecting specific parts 
of the genomic JQMc of said viruses whose expression pro- 
ducts are not always directly detectable by immunological 
methods. 
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invention further aims at providing 
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polypeptides containing sequences in common with polypep- 
tides encoded by the LAV genomic RNA. It relates even more 
particularly to polypeptides comprising antigenic deter* 
minants included in the proteins encoded and expressed by 

5 the LAV genome occuring in nature. An additional object of 
the invention is to further provide means for the 
detection of proteins related to LAV virus, particularly 
for the diagnosis of AIDS or pre-AIDS or. to the contrary, 
for the detection of antibodies against the LAV virus or 

to proteins related therewith. particularly in patients 
afflicted with AIDS or pre-AIOS or more generally in 
asymtomatic carriers and in blood- related products. 
Finally^ the invention also aims at providing immunogenic 
polypeptides, and more particularly protective 

15 polypeptides for use in the preparation of vaccine 

compositions against AIDS or related e^ wilroi i iM " . — 

The present invention relates to additional ONA 
fragments, hybridizable with the genomic RNA of LAV as 
they will be disclosed hereafter, as well as with additio- 

20 nal cDNA variants corresponding to the whole genomes of 
LAV viruses. It further relates to ONA recombinants con- 
taining said DNAs or cDNA fragments. 

The invention relates more particularly to a cONA 
variant corresponding to the whole of LAV retroviral 

25 genomes, which is characterized by a series of restriction 
sites in the order hereafter (from the 5' end to the 3' 
end ) . 

The coordinates of the successive sites of the 
whole LAV genome (restriction map) are indicated hereafter 
30 too. with respect to the Hind III site (selected as of 
coordinate 1) which is located in the R region. The 
coordinates are estimated with an accuracy of i 200 bp : 

Hind III 0 

Sac I 50 
35 Hind III 520 

Pst I 300 

Hind III 1 100 
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8gl tl 

1 

500 

Kpn I 

3 

500 

Kpn I 

3 

900 

Eco RI 

4 

100 

Eco RI 

5 

300 

Sal I 

5 

500 

Kpn I 

6 

100 

Bgl II 

6 

500 

Bgl II 

7 

600 

Hind III 

7 

850 

Bam HI 

8 

150 

Xho I 

8 

600 

Kpn I 

8 

700 

Bgl II 

8 

750 

Bgl II 

9 

150 

Sac I 

9 

200 

Hind III 

9 

250 


Another ONA variant according to this invention 
optionally contains an additional Hind III approximately 
20 at the 5 550 coordinate. 

Reference is further made to fig. 1 which shows a 
more detailed restriction map of said whole/ONA (AJ19). 

An even more detailed^ nuc le ot idi e ^ sequence of a 
preferred ONA according to the invention is shown in^f^j*. 
25 4-12 hereafter. 

The invention further relates to other preferred 
ONA fragments which will be referred to hereafter. 

Additional features of the invention will appear 
in the course of the non-limitative disclosure of additio- 
30 nal features of preferred ONAs of the invention, as well 
as of preferred polypeptides according to the invention. 
Reference will further be had to the drawings in which : 

fig. 1 is the restriction map of a complete LAV genome 
(clone AJ19) ; 

35 - figs. 2 and 3 show diagrammatically parts of the three 
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possible reading phases of LAV genomic RNA, including the 
open reading frames (ORF) apparent in each of said reading 
phases ; 


figs. 4-12 show the turrftnivfl ntrelantirtir iflqunnrfti of 
a complete LAV genome. The possible pa^i d S sequences in 

rela ^i^LcJ^^6^ 8 three p0SSiDle rea di°9 phases related to 
the nuolo o ti d i e sequences shown are also indicated : 

figs. 13-18 reiterate the sequence of part of the LAV 
genome containing the genes coding for the^e nv elo pfeo pro- 
10 teins, with particular boxed ■p ' e p^i d i fg sequences which cor- 
respond/ to groups which normally carry glycosyl groups. 

The sequencing and determination of sites of par- 
ticular interest j*a-s- carried out on a phage recombinant 



corresponding to AJ19 disclosed in the abovesaid British 
15 Patent application Nr. 64 23659. A method for preparing it 
is disclosed in that application. 

The whole recombinant phage ONA of clone XJ19 
(disclosed in the earlier application) was sonicated 
according to the protocol of DEININGER (1983). Analytical 
20 Biochem. 129, 216. ^LtLS. ONA was repaired by a Klenow 
^ » reaction for 12 hours at 16*C. The ONA was electrophoresed 

through 0.8 I agarose gel and ONA in the size range of 
300-600 bp was cut out and electroeluted and precipitated. 
Resuspended ONA (in 10 mM Tris, pH 8 ; 0,1 mM EDTA ) was 
25 ligated into M13mp8 RF ONA (cut by the restriction enzyme 
' ^C^- __Sma I and subsequently alkaline phosphated), using T4 0NA- 

^ and RNA-ligases (Maniatis ^ et al (1982) - Molecular 

cloning - Cold Spring Harbor Laboratory). An £. coli 
strain designated as TG1 was used for further study. This 
30 strain has the following genotype : 

0 

Alac pro, supE, thi . F ' tr a 036 , proAB , lad , ZAMIS.r 

This JL. coli TGI strain has the peculiarity of 
enabling recombinants to be recognized easily. The blue 
colour of the cells transfected with plasmids which did 

35 
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not recombine with a fragment of LAV ONA is not modified. 
To the contrary cells transfected by a recombinant plasmid 
containing a LAV ONA fragment yield white colonies. The 
technique which was used is disclosed in Gene ( 1 983 ), 26 . 
5 101 . 

This strain was transformed with the ligation mix 
using the Hanahan method (Hanahan 0 (1983) J. Mol. Biol. 
166, 557). Cells were plated out on tryptone- agarose plate 
with IPTG and X-gal in soft agarose. White plaques were 

10 either picked and screened or screened directly jjQ, situ 
using nitrocellulose filters. Their DMAs were hybridized 
with nick-translated^ONA inserts of pUC18 jjinj III 
subclones of AJ19. ^ h ia permitted the isolation of the 
plasmids or subclones of A which are identified in the 

15 table hereafter. In relation to this table it should also 
be noted that the designation of each plasmid is followed 
by the deposition number of a cell culture of £. coli TGI 
containing the corresponding plasmid at the "Collection 
Nationale des Cultures de Micro- organismes " (C.N. CM.) of 

20 the Pasteur Institute in Paris, France. A non-transformed 
TGI cell line was also deposited at the C.N. CM. under Nr. 
1-364. All these deposits took place on November 15, 1984. 
The sizes of the corresponding inserts derived from the 
LAV genome have also been indicated. 


TABLE 

Essential features of the recombinant plasmids 


pJ19 - 1 plasmid 


( 1-365) 


0.5 kb 


Hind III - Sac I - Hind III 


pJ19 


17 plasmid 


( 1-367 ) 


0.6 kb 


Hind III - Pst 1 - Hind III 


pJ19 - 6 plasmid 


(1-366) 


1 . 5 kb 


Hind III (5) 

Bam HI 

Xho I 

Kpn I 

8gl II 

Sac I (3* ) 

Hind III 

PJ19-13 plasmid (1-368) 6.7 kb 

Hind II r (5*) 

8gl II 

Kpn I 

Kpn I 

eco RI 

Eco RI 

Sal. I 

Kpn I 

891 II 

Bgl II 

Hind III (3* ) 


7 


Positively hybridizing M13 phage plates were grown 
up for 5 hours and the single-stranded ONAs were 
extracted . 

M13mp8 subclones of XJ19 DNAs were sequenced 
according to the dideoxy me,thod and technology devised by 
Gangor at al (Sanger et alj(1977) ( Proc. Natl. Acad. Sci. 
USA . J± , 5*63 and tM3 cloning and sequencing handbook, 


AMERSHAM ( 1 983 ). JfeHTS" 17-mer oligonucleotide primer 


35, 


a- SdATP (400Ci/mmol. AMERSHAM ) , and 0.5X-5X buffer 
10 gradient gels (Biggen M.O. et al ( 1983], Proc. Natl. Acad. 
Sci. USA, 5_Q., 3963) were used. Gels were read and put into 
the computer under the programs of Staden (Staden R. 
(1982), Nucl. Acids Res. 1 0 . 4731). All the appropriate 
references and methods can be found in the AMERSHAM M13 
15 cloning and sequencing handbook. 

The complete sequence of AJ19 was deduced from the 
experiments as further disclosed hereafter. 


Figs. 4-12 provide the ONA r\\^TSnx£dl o sequence of 
the complete genome of LAV. The numbering of the 
20 nucleotides starts from a left most Hind III restriction 
site (5'AAG..,) of the restriction map. The numbering 
occurs in tens whereby the last zero number of each of the 
numbers occuring on the drawings is located just below the 
nucleotide corresponding to the nucleotides designated. 
25 ^r^f^the nucleotide at position 10 is T, the nucleotide at 
position 20 is C, etc.. 

j i Above each of the lines of the successive nucleo- 

-ttffte sequences there are provided three lines of single 
^letters corresponding to the ^mS ^oi^tS ^sequence deduced 
30 from the ONA sequence (using the genetic code) for each j£r 
the three reading phases, whereby said single letters have 
the following meanings. 
A : alanine 
R : arginine 
35 K : lysine 

H : histidine 
C : cysteine 
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M : methionine 
W : tryptophan 
F : phenylalanine 

Y : tyrosine 
L : leucine 

V : valine 

I : isoleucine 
<3 : glycine 
T : threonine 
S : serine 
E : glutamic acid 
0 : Aspartic acid 
N : asparagine 
Q : glutamine 
P proline. 

The asterik signs correspond to stop codons 

(i.e. TAA , TAG and TGA) . 

.Starting above the first line of the DNA 
- ^at e xa o ti^iO ' sequence of fig. 4, the three reading phases 

-4 

are respectively marked "1", "2", "3", on the left 
handside of /jthe drawing. The same relative presentation of 
the three \^Sr2^?cal. reading phases is then used^a^ft^ over- 
the successive? lines of the LAV sequence . 

Figs. 2 and 3 provide a diagrammatized represen- 
tation of the lengths of the successive open reading 
frames corresponding to the successive reading phases 
(also referred to by numbers "1", "2" and "3" appearing in 
the left handside part of fig. 2/. The relative positions 
of these open reading frames (ORF) with respect to the 
^2k cj^^^iV structure of the LAV genome is referred to by 
the scale of numbers representative of the respective 
positions of the corresponding nucleotides in the DNA 
sequence. The vertical bars correspond to the positions of 
the corresponding stop codons. 
1 ) The "gag gene" (or ORF-oaa) 

The "gag gene" codes for core proteins. 
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Particularly it appears that a genomic fragment (ORF-gag) 
thought to code for the core antigens including the p2 5 , 
p16 and p13 proteins is located between Jffffcieo^ia^U' 
position 236 (starting with 5' CTA GCG GAG 3') and 
^UG^oot t ^i e- position 1759 (ending by CTCG TCA CAA 3'). The 
structure of the peptides or proteins encoded by parts of 
said ORF is deemed to be that corresponding to phase 2. 


The methionine -a mi n 6a o id 7 M" coded by the ATG at 
position 260-262 is the probable initiation methionine of 
to the gag protein precursor. The end of ORF-gag and 
accordingly of gag protein appears to be located at 
position 1759. 

The beginning of p25 protein, thought to start by 

vcuA&> 

a P- I -V-Q-N- I -Q-G-Q-M-V-H .... j> m i n o a c * rd sequence is 



15 thought to be coded for by the Tu i ol eo t id i o sequence 
CCTATA . . . , starting at position 656. 

Hydrophilic peptides in the gag open reading frame 
are identified hereafter. They are defined starting from 
a m Sy o aei d 1 = Met (M) coded by the ATG starting from 260-2 

20 in the LAV ONA sequence. 


Those hydrophilic peptides are 

12-32 $n7r(^rfitf^ nr 1 ,,e i t,Q 
i 


37-46 
49-79 

25 88-153 
158-165 
178-188 
200-220 
226-234 

30 239-264 
288-331 
352-361 
377-390 
399-432 

35 437-484 
492-498 
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The invention also relates to any combination of 
these peptides. 

2) The -ool aene" (or ORF-ool) 

Figs. 4-12 also, show that the ONA fragments 
5 extending from mjcT eo tidio position 1555 (starting with 
5'TTT TTT ....3' to ^uolootid i- o position 5086 is thought 
to correspond to the pol gene. The polypeptidic structure 
of the corresponding polypeptides is deemed to be that 
corresponding to phase 1. It stops at position 4563 (end 
10 by 5 * G GAT GAG GAT 3 ' ) . 


5670 (starting with 5 ' AAA GAG GAG A .... 3 ' ) up to nucleo- 
•*4SXc position 8132 (ending by ....A ACT AAA GAA 3'). 
V o ^l^ i ^ i ^iQ ^ i e structures of sequences of the envelope 


protein correspond to those read according to the "phase 
20 3" reading phase. 


Additional feature* of the envelope protein coded 
by the env genes appear on figs. 13-18. These are to be 
25 considered as paired figs. 13 and 14 ; 15 and 16 ; 17 and 
Irrespectively. 

It is to be mentioned that because of format 
d if f icult ies / 

Fig. 14 overlaps to some extent with fig. 13. 
30 Fig. 16 overlaps to some extent with fig. 15^ 

Fig. 18 overlaps to some extent with fig. 17. 
ThuSj for instance, figs, 13 and 14 must be con- 
sidered together. Particularly the sequence shown on the 
first line on the top of fig. 13 overlaps with the 
35 sequence shown on the first line on the top of fig. 14. In 
other words, the starting of the reading of the successive 


15 



the level 
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sequences of the env gene as represented in figs. 13-18 
involves first reading the first line at the top of fig. 
13 then proceeding further with the first line of fig. 14. 
One then returns to the beginning of the second line of 
5 fig. 13, then again further proceed with the reading of 
the second line of page 14, o t c . i : The same observations 
then apply to the reading of the paired figs. 15 and 16, 
and paired figs. 17 and 18. respectively. 

The locations of neutralizing epitopes are further 

10 apparent in figs. 13-18. ^ e f e r e no e is more particularly 
made to £he boxed groups of three letters included in the 
anun^Eiqsequences of the envelope proteins (reading 
phase 3) which can be designated generally by the formula^ 
N-X-S or N-X-T, wherein X is any other possible ^mpTSacid ■. 

15 Thus, the initial protein product of the env gene^^ a gly- 
coprotein of molecular weight in excess of 91,000. These 
groups are deemed to generally carry glycosylated groups. 
These N-X-S and N-X-T aroups wi^fv attached glycosylated 


groups form together h ydf op hy l ie regions of. the protein 
20 and are deemed to be located at the periphery of and to be 
exposed outwardly with respect to the normal conformation 
of the proteins. Consequently, they are considered as being 
epitopes which can efficiently be brought into play in 
vaccine compositions. 
25 The invention thus concerns with more particulari- 

ty peptide sequences included in the env/proteins and 
excizable therefrom (or having the same a min o aoxd struc- 
ture). having sizes not exceeding ZOO^ aminoaeicir . 

Preferred peptides of this invention (referred to 
30 hereafter as a, b. c, d, e, are deemed to correspond to 
those encoded by the nucleotide sequences which extend, 
respectively, between the following positions : 

a) from about 6095 to about 6200 

b) H " 6260 - - 6310 
35 c) " 6390 - " 6440 

d ) " " 6485 " " 6620 
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e) " " 6860 " " 6930 

f) - • 7535 " " 7630 

Other hydrophilic peptides. in the env open reading 
frame are identified hereafter, t h e ? are defined starting 

A 

5 from 

aminoacid 1 3 lysine ( K ) coded by the AAA at position 
5670-2 in the LAV ONA sequence. 

- These hydrophilic peptides are 

8-23 aminoir iris inclusive 

-7 

10 63-78 
82-90 
97-123 

127-183 

197-201 
f5 239-294 

300-327 

334-381 

397-424 

466-500 
20 5tO-523 

551-577 

594-603 

621-630 

657-679 
25 719-758 

780-803 

The invention also relates to any combination of 
these peptides. 
4 ) Jhe other QRF 
30 The invention further concerns DNA sequences which 

provide open reading frames defined as ORF-Q , ORF-R and as 
- 2 - ( -3» j -4- ( -5", the relative position of which 
appears more particularly in figs. 2 and 3. 

These ORFs have the following locations : 
35 ORF-Q phase 1 start 4478 stop 5086 

ORF-R " 2 " 8249 - 8896 


ORF-1 " 1 

ORF-2 " 2 

ORF-3 " 1 

ORF-4 " 2 

ORF-5 1 
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5029 " 5316 

5273 " 5515 

5383 " 5616 

5519 " 5773 

7966 ~ 8279 

The LTR (long terminal repeats) can be defined as 
lying between position 8560 and position 160 (end exten- 
ding over position 9097/1). As a matter of fact the end of 
the genome is at 9097 and, because of the LTR structure of 
10 the retrovirus. links up with the beginning of the 
sequence : 

Hind III 
CTCAATAAAGCTTGCCTTG 


n 


15 9097 1 

The invention concerns more particularly all the 
DNA fragments which have been more specifically referred 
to hereabove and which correspond to open reading frames. 
It will be understood that the man skilled in the art will 

20 be able to obtain them all. far instance by cleaving an 
entire DNA corresponding to the complete genome of a LAV 
species. such as by cleavage by a partial or complete 
digestion thereof with a suitable restriction enzyme and 
by the subsequent recovery of the relevant fragments. The 

25 different DNAs disclosed in the earlier mentioned British 
Application can be resorted to also as a source of sui- 
table fragments. The techniques disclosed hereabove for 
the isolation of the fragments which were then included in 
the plasmids referred to hereabove and which were then 

30 used for the ON A sequencing can be used. 

Of course -ot.her methods can be used. Some of them 

Kave been ^ffju inp lir^ the earlier British Application. 

roro / re* n e e is, for instance, made to the following methods. 
-1 J 

a) DNA can be transfected into mammalian cells 

35 with appropriate selection markers by a variety of tec- 


hniques. ^fiCchjum phosphate precipitation. polyethylene 


1 4 

glycol, protoplast-fusion, etc.. 

b) ONA fragments corresponding to genes can be 
cloned into expression vectors for £. coli , yeast/ or 
mammalian cells and the resultant proteins purified. 

c) The provival ONA can be "shot-gunned" (frag- 
mented) ' into procaryotic expression vectors to generate 
fusion polypeptides. Recombinants producing antigenically 
competent fusion proteins can be identified by simply 
screening the recombinants with antibodies against LAV 
antigens . 

The invention also relates more specifically to 
cloned probes which can be made starting from any ONA 
fragment according to this invention, thus to recombinant 
ONAs containing such fragments, particularly any plasmids 
amplifiable in procaryotic or eucaryotic cells and carry- 
ing said fragments. 

Using the cloned ONA fragments as a molecular hy- 
bridization probe - either by marking with radionucleo- 
tides or with fluorescent reagents - LAV virion RNA may be 
detected directly in the blood, body fluids and blood 
products (e.g. of the ^iCntVr(^^^h y 1 i? factors/ such as 
Factor VIII concentrates) and vaccines, i.e. hepatitis 8 
vaccine. It has already been shown that whole virus can be 
detected in culture supernatants of LAV producing cells. A 
suitable method for achieving that detection comprises 
immobilizing virus onto «s-a-rd a support, e.g. nitrocellulose 
filters, etc., disrupting the virion, and hybridizing with 
labelled ( rad iola belled or "cold" fluorescent- or 
enzyme-labelled) probes. Such an approach has already been 
developed for Hepatitis 8 virus in peripheral blood 
(according to SCOTTO J. et al. Hepatology (1983). 3. 
379-384 ) . 

Probes according to the invention can also be used 
for rapid screening of genomic ONA derived from the tissue 
of patients with LAV related symptoms^ to see if the pro- 
viral ONA or RNA is present in host tissue and other 
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tissues .. 

, A method which can be used for such screening 
xxtmpffiffft^he following steps : extraction of ONA from tis- 
sue, restriction enzyme cleavage of said ONA, electro- 
phoresis of the fragments and Southern blotting of genomic 


ONA from tis sues .^sljf) sequent hybridization with labelled 
cloned LAV provival ONA. Hybridization jji situ can also be 
used. 

Lymphatic fluids and tissues and other non-lympha- 
10 tic tissues of humans, primates and other mammalian 
species can also be screened to see if other o woi u t i u m i a 11 9 
related retrovirus exist. The methods referred to here- 
above can be used, although hybridization and washings 
would be done under non -stringent conditions. A 
15 The ONA according to the invention can^be used 

^a-ts-o* for achieving the expression of LAV viral antigens 
for diagnostic purposes. 

The invention also relates to the polypeptides 
themselves which can be expressed by the different ONAs of 
20 the inventions, particularly by the ORFs or fragments 
thereof, in appropriate hosts, particularly procaryotic or 
eucaryoti'c hosts, after transformation thereof with a 
suitable vector previously modified by the corresponding 
ONAs . 

25 These polypeptides can be used as diagnostic 

tools, particularly for the detection of antibodies in 
biological media, particularly in sera or tissues of 
persons afflicted with pre-AIOS or AI0S, or simply 
carrying antibodies in the absence of any apparent 

30 disorders. Conversely, the different peptides according to 
this invention can be used themselves for the production 
of antibodies, preferably monoclonal antibodies specific 
of the different peptides respectively. For the production 
of hybridomas secreting said monoclonal antibodies^ 

35 conventional production and screening methods are used. 
These monoclonal antibodies, which themselves are part of 
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the invention; then provide very useful tools for the 
identification and even determination of relative 
proportions of the different polypeptides or proteins in 
biological samples, particularly human samples containing 
5 LAV or related viruses. 

Thus, all of the above peptides can be used in 
diagnostics as sources of immunogens or antigens free of 
viral particles, produced using non-permissive systems, 
and thus of little or no biohazard risk. 

10 The invention further relates to the hosts (proca- 

ryotic or eucaryotic cells) which are transformed by the 
above-mentioned recombinants and which are capable of 
expressing said DNA fragments. 

Finally^ it also relates to vaccine compositions 

15 whose active principle is to be constituted by any of the 
expressed antigens, i.e. whole antigens, fusion polypep- 
tides or oligopeptides) in association with a suitable 
pJt-arMMCUU Ileal or> physiologically acceptable carrier. 

7 

Preferably, the active principles to be considered 
20 in that f ield , consi s t of the peptides containing less than 
250 ^mTnoa^ e r o ^ jni t s , preferably less than 150 as deducible 
T^S^ the complete LAV, and even more preferably 

those peptides which contain one or more groups selected 
from N-X-S and N-X-T as defined above. Preferred peptides 
25 for use in the production of vaccinating principles are 
peptides (a) to (fJ as defined above. By way of example 
having no limitative character, there may be mentioned 
that suitable dosages of the vaccine compositions are 
those which enable administration to the host, 
30 particularly human host, ranging from 10 to 500 micrograms 
per kg, for instance 50 to 100 micrograms per kg. 

3r the purpose of clarity figs. 19 to 26 are 


35 


added. -r e fdfonee may be made thereto in case of difficul- 
ties of reading blurred parts of figs, u to 12. 
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Needless to say that figs. 19-26 are merely a 

reiteration of the whole ONA sequence of the LAV JjjanxSw^T" 

i 

Finallyj the invention also concerns vectors for 
the transformation of eucaryotic cells of human origin, 
particularly, lymphocytes, the polymerases of which are 
capable of recognizing the LTRs of LAV. Particularly said 
vectors are characterized by the presence of a LAV LTR 
therein, said LTR being then active as a promoter enabling 
the efficient transcription and translation in a suitable 
host of the above defined, jo* a- DNA insert coding for a 
determined protein placed under its controls. 

Needless to say that the invention extends to all 
variants of genomes and corresponding ONA fragments (ORFs) 
having substantially equivalent properties, all of said 
genomes belonging to retroviruses which can be considered 
as equivalents of LAV. 


